Online traffic state estimation based on 
floating car data 

Arne Resting^ and Martin Treiber^ 



Abstract B 

esides the traditional data collection by stationary detectors, recent advances 
in wireless and sensor technologies have promoted new potentials for a vehicle- 
based data collection and local dissemination of information. By means of mi- 
croscopic trafhc simulations we study the problem of online estimation of the 
current traffic situation based on floating car data. Our focus is on the estima- 
tion on the up- and downstream jam fronts determining the extension of traffic 
congestion. We study the impact of delayed information transmission by short- 
range communication via wireless LAN in contrast to instantaneous information 
transmission to the roadside units by means of mobile radio. The delayed in- 
formation transmission leads to systematic estimation errors which cannot be 
compensated for by a higher percentage of probe vehicles. Additional flow mea- 
surements from stationary detectors allow for a model-based prediction which is 
effective for much lower floating car percentages than 1 %. 



1 Introduction 

Detailed and reliable online traffic state estimation is a prerequisite for advanced 
traffic information systems and the next generation of driver assistance sys- 
tems [1, 2]. Recent advances in wireless and sensor technologies have therefore 
promoted new potentials for a vehicle-infrastructure integration (VII) system 
which allows for a wireless communication between roadside sensors and vehicles 
equipped with communication interfaces [3, 4]. Data gathered from equipped ve- 
hicles ('probe vehicles' or 'floating cars') can be used to assess and predict traffic 
conditions and, in turn, information about incidents, travel times or congestion 
can be communicated from roadside units to the vehicles [5, 6, 7, 8]. 
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In this paper, we consider the following setup for an online trafRc-surveillance 
application based on floating car data: Probe vehicles collect their positions 
and speeds over time and communicate this information to roadside units either 
periodically by instantaneous forwarding by mobile phone communication, or, 
alternatively, by short-range communication when passing the location of a road- 
side unit. In any case, the roadside units are connected to each other and collect 
the data periodically for an online traffic state estimation. More specifically, we 
consider the prediction of the upstream jam front which, for example, can be 
used to warn the driver when congestion is ahead. For this purpose, we study 
the data collection, the communication to roadside units, and the traffic state 
estimation 'in-the-loop' by means of microscopic traffic simulations as a function 
of the percentage of equipped vehicles. This approach enables us to compare the 
estimates of both communication modes to the - known and reproducible - traf- 
fic situation. Since the expected equipment levels in the first deployment phase 
will be small, we consider small percentages of probe vehicles below 3 % only. 

The paper is structured as follows: In Sec. 2, the microscopic simulation set- 
up is described. In Sec. 3, the algorithm for the estimation of jam fronts is 
presented and a measure for evaluating the estimation quality is defined. In 
Sec. 4 an alternative model-based approach is presented which uses additionally 
flow measurements from stationary detector. We close with a discussion in Sec. 5. 



2 Microscopic Simulation Setup 

2.1 Reference Scenario with Empirical Boundary 
Conditions 

For the following case study of online estimation of jam fronts one needs a suited 
and realistic traffic situation serving as reference scenario. For this purpose we 
start with the empirical trafiic jam observed on the German freeway A5 shown 
in Fig. l(left). In the simulation environment, we consider a road-section with 
three lanes and a fiow-conserving bottleneck [9] . The empirical traffic flows and 
truck percentages from one detector location (shown in Fig. 2) serve as upstream 
boundary conditions assuring realistic traffic volumes and degrees of trafiic het- 
erogeneity. The simulator uses the; Intelligent Driver Model [9] as a simple, yet 
realistic, car-following model, and the general-purpose lane-changing algorithm 
MOBIL [10]. The parameters of the car- following model (see Table 1) and the 
bottleneck strength (modeled by a local increase of the time gap parameter T 
by 40 % in a given time interval) have been adapted in order to reproduce the 
empirically observed spatiotemporal congestion pattern on a semi-quantitative 
level. Figure 1 (right) shows the simulation result. 
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Fig. 1 Empirical traffic state from the German freeway A5 reconstructed from stationary loop 
detector data with an dedicated interpolation method [11] (left) and spatiotemporal traffic 
dynamics in a calibrated multi-lane simulation (right). 
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Fig. 2 Timeseries of traffic flow and truck fraction from the German freeway A5 used as 
upstream boundary conditions in the simulation. 



Parameter 


Car 


Truck 


Desired speed vq 


150 km/h 


90 km/h 


Desired time gap T 


1.15s 


2.1s 


Jam distance sq 


2.5 m 


4.0m 


Gap parameter si 


3.0m 


3.0m 


Max. acceleration a 


1.0m/s2 


0.8m/s2 


Comfortable deceleration b 


2.0 m/s^ 


2.0m/s2 



Table 1 Model parameters of the Intelligent Driver Model (IDM) [9] used in the reference 
scenario shown in Fig. 1. The vehicle length is 8 m for cars and 15 m for trucks. 



2.2 Floating Car Samples 

In the microscopic simulation, a given percentage of vehicles is randomly selected 
to generate floating-car data. These vehicles record their positions and speeds 
with a period of 5 s. The random seed has been fixed in order to assure identical 
samples of floating cars (FC). In total, 13 394 vehicles have been simulated in the 
three-lane scenario over 3.5 h. 137 vehicles (63) have been selected for a given 
FC percentage of 1 % (0.5%), i.e., the sample realized by the seed effectively 
contain 1.02% and 0.47%, respectively. This corresponds to an average time 
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gap of 1.5 min (3.3 min). Figure 3 shows the trajectories of these FC. In contrast 
to stationary detector data which are typically provided once per minute, time 
gaps between floating car are distributed randomly. For a constant traffic flow, 
these gaps can be described approximately by an exponential distribution [12]. 




Fig. 3 Random samples of floating cars for equipment levels of 1 % (left) and 0.5% (right). 



2.3 Modes of Vehicle-to- Infrastructure Communication 

For transmitting the floating car data to the roadside units (RSU) we simulate 
the following operation modes which can be considered as the limiting cases of 
delay times in an idealized communication: 

• Local transmission from the probe vehicle to the RSU: All infor- 
mation is broadcasted when passing the cross-section. This corresponds to a 
short-range communication, e.g. by wireless communication interfaces with 
a minimal broadcast range (WLAN). 

• Instantaneous transmission: After collection, data is directly transmitted 
to the RSU. This corresponds to a nonlocal communication, e.g. by using 
the cellular phone network (GSM). 

In the following section, we study the impact of delayed information trans- 
mission by short-range communication in contrast to instantaneous information 
transmission. Obviously, the time delay in the former case depends on the num- 
ber of RSU and their positions. Moreover, the delay will be determined by the 
local traffic conditions. For example, a severe traffic jam will have a significant 
impact on the up-to-dateness of incoming FCD. 



3 Online Estimation of Jam Fronts 

3 Online Estimation of Jam Fronts 
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After having defined the simulation setup, we now turn to the problem of using 
the currently available information at the RSU for predicting the positions of the 
jam fronts in an online application. Reliable and up-to-date information about 
these 'shock fronts' can then be used for applications in safety, driver information 
or traffic-adaptive cruise control systems [2]. 



3. 1 Extrapolation of Jam Fronts by Weighted Linear 
Regression 

When a floating car passes through the traffic jam, the first drop in its speed 
measurements below a certain threshold, e.g. 50km/h, defines one 'spatiotem- 
poral' point (t^, Xi) of the upstream jam front. After having gathered a sufficient 
number n of points (the last point n at time the position of the jam front 
at times t > t„ can be predicted by a linear extrapolation which is based on a 
weighted linear regression. In order to favor newer data points over older ones, 
we use for the weights an exponential 

if(<-iO = e-^(*-*'). (1) 

In the simulation, this 'online' extrapolation is carried out for both communica- 
tion modes. Examples of the estimated jam fronts are shown in Fig. 4. 




Fig. 4 Examples of extrapolated up- and downstream jam fronts with weighted regression 
using A = 0.33 min~^. The 'ground truth' traffic dynamics is shown for comparison. The 
delayed transmission by short-range wireless communication results in a less precise estimation 
of the upstream jam front (right) than in the case of instantaneous transmission (left). 
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3.2 Evaluation of the Online Estimation Results 



The prediction of the downstream jam front is relatively easy since this front is 
either stationary (for activated bottlenecks) or propagates with a fixed velocity 
of -15km/h against the direction of travel (cf. the traffic dynamics depicted 
in Fig. 1). Thus, we focus on the estimation of the upstream jam front a:"^ 
which does not have a characteristic propagation velocity but depends on the 
prevailing in- and outflows. As error quantity, we consider the deviation between 
the estimation x"^ derived from Eq. (1) and the simulation a;^^ (defined by a 
speed threshold of 50km/h): 

Ax{t)=xZit)-x:P^{t). (2) 

From the distribution of the estimation errors Ax{ti), i ^ 1 . . . n, the mean devi- 
ation (Ax) of the estimated jam front is defined by averaging over all predicted 
positions. Furthermore, the variation <Tx is a relevant measure for its reliability. 

Figure 5 shows the resulting mean deviation (Ax) and the variation ax of 
the upstream jam front as a function of percentage of equipped vehicles for 
the instantaneous (GSM) and the delayed (WLAN) communication modes. The 
'bands' are calculated from single simulation runs (shown as single points) by 
a local linear regression method [13]. Both modes show the same cross-over 
behavior: Below a fioating car fraction of about 1.2%, the estimation errors 
increase with decreasing equipment level. Above 1.2%, the prediction errors 
stay approximately constant indicating a diminishing benefit from additional 
(and thus redundant) information provided by fioating cars for the chosen value 
of A. For an equipment level of 1.2 %, the average time gap between two passing 
vehicles is about 1 min. In this case, the prediction error is about (—36 ± 400) m 
for the GSM mode and (320 ±850) m for the WLAN mode. The large systematic 
errors in the prediction in the local information transmission is due to the large 
delays of up to 10 min before equipped vehicles can broadcast the information to 
the RSU located downstream the traffic jam. For a local vehicle-infrastructure 
communication, this dominating time delay can only be compensated for by 
increasing the number of RSU along the road. 



4 Model-based Estimation using Additional Flow Data 

The online detection of the upstream jam front solely from the speed informa- 
tion of floating vehicles is only possible with prediction errors particularly when 
considering a local vehicle-infrastructure communication by wireless interfaces 
to only one RSU located downstream a bottleneck. In order to improve the es- 
timation quality, we additionally consider flow data from stationary detectors. 
Since the number of vehicles is a conserved quantity, model-based approaches 
using the continuity equation can be applied. A consequence of continuity is the 
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Fig. 5 Mean error and variation of estimated upstream jam front position. The percentage of 
equipped vehicles is systematically varied. 



well-known propagation equation for shock fronts 

_ dXfront _ Q2 - Ql /„s 

'™* dt Pcong((32) - Pfroc(Ql) ' 

Thus, the propagation speed of the upstream jam front can be calculated if the 
flows Ql and Q2 of either side of the jam front are known (see left sketch in 
Fig. 6), and if one additionally assumes a fundamental diagram, i.e. a functional 
relation between traffic flow Q and density p. As a simple model, we consider a 
'triangular' fundamental diagram (see right plot in Fig. 6) with only two propa- 
gation velocities: (i) Vq is the speed in free traffic conditions and corresponds to 
the mean travel speed of the vehicles (with a value of the order of lOOkm/h. (ii) 
In contrast, perturbations in congested traffic propagate against the direction of 
travel with a (remarkably constant) velocity of about Vg = — 15kni/h. By using 
both velocities, the (flow) information measured at the detector positions x^p 
and Xdown can be connected to the current upstream jam front position Xfront: 

Ql{t) = Qfroo(a;front,i) = Qup (^t - '^^'""^^ ^"'^ ^ , (4) 

Q2{t) = Qcong (affront, i) = Qdown \t— ' ^"'^"i ^'""^ ) • (5) 

V \Vg\ J 

Interestingly, both detector measurements are requested at past times which 
qualifies this approach for an online application in real-time. 

In Fig. 1, the proposed prediction model has been applied to the empirical 
traffic data using the calibrated parameters T = 1.8 s (time gap), Pmax = 80 /km 
(maximum density), and Vq — 120km/h (desired free speed). Here, the starting 
point for the integration of Eq. (3) has been set manually. In combination with 
available FCD, this procedure can be automated because the first floating car 
that detects the traffic breakdown triggers the integration. Furthermore, the 
current position of the upstream jam front Xf].ont(i) detected by a floating car 
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Fig. 6 Left: Sketch of the considered detectors located up- and downstream of the traffic jam. 
Right: Assumed flow-density relationship {fundamental diagram) with two propagation speeds 
only. 

at (a past) time t can be used for an automated correction of the prediction 
deviations which are summing up by errors in the jam front velocity Cfront(0- 
Furthermore, the deviations can be used as input to an onhne cahbration of the 
model parameters. Whenever the jam front position has been reset at time t, the 
ODE (3) has to be integrated again up to the actual time t' > t. Figure 7 shows 
the prediction results of the upstream jam fronts for FC equipment percentages of 
0.1% and 0.5% using the uncalibrated parameters T ~ 2.0 s, Pmax — 100 /km, 
and Vo = lOOkm/h. For the purpose of illustration, the jam front prediction 
without automated FC correction is plotted as thin dashed lines. 

Finally, we assess the prediction accuracy by means of the measure (2) in 
Fig. 8. Compared to the results shown in Fig. 5, the estimation model based 
on the continuity equation is more robust than the prediction based solely on 
floating cars. Note that this robustness is a prerequisite in an online applica- 
tion. Already for very low equipment percentages of the order of 0.1%, mean 
deviations are about 200 m with standard deviations of 400 m for both FC com- 
munication modes. 



5 Discussion and Conclusions 

Car-to-infrastructure communication offers interesting potentials for driver in- 
formation services which are more detailed and more up-to-date than today's 
TMC messages and/or digital maps. For instance, infrastructure-related mes- 
sages, e.g., about a temporary workzone, can be transmitted by a road-side unit 
(RSU) via local wireless communication devices. However, reliable information 
about the dynamic traffic situation requires up-to-date traffic data and robust 
prediction models. In this paper, we have considered a vehicle-based approach to 
collect traffic data and use the data to estimate the upstream and downstream 
fronts of a traffic jam. A small percentage of probe vehicles records their trajec- 
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Fig. 7 Estimation of upstream jam front using a combination of speed data from floating cars 
(0.1 % and 0.5 %) and flow data (1-min aggregation interval) from stationary detectors located 
up- and downstream the traffic jam. 




Fig. 8 Mean error and variation of the estimated upstream jam front position for the 
continuity-equation based approach using flow data from stationary detectors in combination 
with floating car data. 



tories and speeds to transmit the data to a RSU. We have shown that the delays 
resulting from a local communication to a single RSU positioned downstream 
the bottleneck will prevent a reliable jam front estimation based on floating- 
car data only. Obviously, the delay time can be reduced by using more road-side 
units along the roadway so that probe vehicles can transmit their data in shorter 
intervals. 
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Furthermore, we considered an alternative prediction approach using floating 
car data in combination with flow data collected by stationary detectors, which 
allows to apply the propagation equation of shock waves. Since probe vehicle 
data can be used for an automatic correction of the estimates and for an online 
calibration of the model parameters, this is a robust and promising concept 
which is effective already for equipment percentages of probe vehicles of a few 
per mill. Moreover, it is suited for a real-time application in practice. 

Travel times and time losses due to congestion are also relevant dynamic 
quantities for local driver information services. Travel times can directly be de- 
rived from the probe vehicle data. However, this estimation does not allow to 
predict travel times which becomes relevant when no probe vehicle passes the 
RSU for a while. Alternatively, travel times can be calculated in a more robust 
way from cumulative vehicles counts. Again, probe vehicles can be used to reset 
the accumulated counting errors. 
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